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Abstract 

In this work, we aim to understand the influence of 
the heterogeneity of infection rates on the Susceptible- 
Infected-Susceptible (SIS) epidemic spreading. Em¬ 
ploying the classic SIS model as the benchmark, we 
study the influence of the independently identically dis¬ 
tributed infection rates on the average fraction of in¬ 
fected nodes in the metastable state. The log-normal, 
gamma and a newly designed distributions are con¬ 
sidered for infection rates. We find that, when the 
recovery rate is small, i.e. the epidemic spreads out 
in both homogeneous and heterogeneous cases: 1) the 
heterogeneity of infection rates on average retards the 
virus spreading, and 2) a larger even-order moment of 
the infection rates leads to a smaller average fraction of 
infected nodes, but the odd-order moments contribute 
in the opposite way; when the recovery rate is large, 
i.e. the epidemic may die out or infect a small fraction 
of the population, the heterogeneity of infection rates 
may enhance the probability that the epidemic spreads 
out. Finally, we verify our conclusions via real-world 
networks with their heterogeneous infection rates. Our 
results suggest that, in reality the epidemic spread may 
not be so severe as the classic SIS model indicates, but 
to eliminate the epidemic is probably more difficult. 


1 Introduction 


infected or susceptible at any time t. Each infected 
node infects each of its susceptible neighbors with an 
infection rate /3. The infected node can be recovered 
with a recovery rate 6. Both infection and recovery 
processes are independent Poisson processes. The av¬ 
erage fraction ?/oo of the infected nodes in the meta¬ 
stable state, ranging in [0,1], indicates how severe the 
influence of the virus is: the larger ?/oo is, the more 
severely the network is infected. 


The classic SIS model assumes that the infection rate 
P is the same for all infected-susceptible node pairs and 
so is the recovery rate 6 for all nodes. Most studies 
are focusing on the relationship between the effective 
infection rate r and the average fraction j/oo of infec¬ 
ted nodes or the epidemic threshold in the virus con¬ 
tamination process with homogeneous infection (recov¬ 
ery) rates. However, in reality, neither the contact 
frequency [l3| between a pair of individuals in social 
networks nor the connecting frequency between a pair 
of nodes in computer networks is constant. Infection 
rates can be different from pairs to pairs, thus hetero¬ 
geneous. Many studies on real diseases, such as SARS 
351 and Plasmodium falciparum infection 0 also re¬ 


veal the heterogeneity of infection rates. Furthermore, 
Smith et al. [30| suggest that the distribution of in¬ 


fection rates in different populations may be varied as 
well, and Wang et al. hud that infection rates with 
the log-normal distribution fit best the data of SARS 
in 2003 by applying their model. 


The studies on contagion processes in networks are 
strongly motivated and justified by the anticipated out¬ 
breaks of epidemic diseases in a population and non¬ 
stop threats of cyber security in computer networks 
J20|L The Susce ptib le-infected-susceptible 
[jjz, 10, 18, 29, 31, 0 is one of the most 
widely used models to describe such processes. In the 
continuous-time Markovian SIS model, a node is either 


(SIS) model 


In this paper, we explore the effect of heterogeneous 
infection rates on the average fraction j/oo of infected 
nodes in a systematic way. We propose a SIS model, in 
a network with N nodes, with the homogeneous recov¬ 
ery rate 5 but heterogeneous infection rates Pij (= Pji , 
i = 1, 2,..., N, j = 1, 2,..., N and i ^ j) between node 
i and node j. Similar to the classic homogeneous SIS, 
our SIS model with heterogeneous infection rates is as 
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well a Markovian process where the time for an infec¬ 
ted node i to infect each of its susceptible neighbors j is 
an independent exponential random variable with av¬ 
erage The homogeneous SIS model has the same 
infection rate (3 for all node pairs whereas all the in¬ 
fection rates in our heterogeneous SIS are independent 
and identically distributed (i.i.d.) random variables. 
We study how the distribution of infection rates influ¬ 
ences the average fraction yoo of infected nodes in the 
metastable state. 


A few recent papers 


25-27,33 have taken into 


account either the heterogeneous infection or recovery 
rates. In [27|, we explored the influence of degree-based 


recovery rates on the average fraction of infected nodes 
in the metastable state. Preciado et al. 25, M dis¬ 


cussed how to choose the infection and recovery rates 
from given discrete sets to let the virus die out. Fu et al. 
[l^ studied the epidemic threshold when the infection 
rates depend on the node degrees and Buono et al. Q 
considered a specific distribution of infection rates and 
observed slow epidemic extinction phenomenon. Yang 
and Zhou [36| gave an edge-based mean-field solution 
of the epidemic threshold in regular networks (the de¬ 
grees of all nodes are the same) with i.i.d. heterogen¬ 
eous infection rates (following uniform or power-law 
distribution). 


In this paper, we explore the influence of heterogen¬ 
eous infection rates on the epidemic spreading. In prac¬ 
tice, the number of new infections in a period of time 
can be used to estimate the infection rate, for example, 
[ 2 ^ counts the number of infected people per time in¬ 
terval (daily, weekly, etc.) to indicate the infection 
rate; [9|, illustrating a strategy to estimate the time- 
varying transmission rates for the spread of infection, 
also takes into account the daily distribution of new 
infections. Besides the number of new infections, the 
interacting frequencies between two neighboring nodes 
have also been employed to estimate the infection rate, 
for example, the infection rate has been considered to 
be proportional to the interacting frequency. The aver¬ 
age infection rate obtained in both scenarios has been 
used as the infection rate in the homogeneous epidemic 
model. Our work points out how such assumption of 
homogeneous rates would differ from real-world hetero¬ 
geneous infection rates with respect to their influence 
on the fraction of infected population. We consider sev¬ 
eral representative distributions with the same mean 


but iMher moments tunable, since the influence of the 
mearDJ has been wide ly studied in the homogeneous SIS 
model 


ig 


22 


24l | . To our best knowledge, our work 


is the first to discuss the influence of higher moments 
of the infection rate distribution in epidemic models. 


2 SIS model with heterogeneous 
infection rates 


In this section, we introduce the classic SIS model, ba¬ 
sic network models, the heterogeneous infection rates 
and the simulation settings of the SIS model with het¬ 
erogeneous infection rates on a network. 


2.1 The classic SIS model 

In the continuous-time Markovian SIS model on a net¬ 
work with N nodes, the state of a node at any time t 
is a Bernoulli random variable, where Xi(t) = 0 rep¬ 
resents that node i is susceptible and Xi{t) = 1 that 
node i is infected. Each infected node infects each of 
its susceptible neighbors with an infection rate /3. The 
infected node can recover with a recovery rate 5. Both 
infection and recovery processes are independent Pois¬ 
son processes. The ratio r = j is called the effective 
infection rate. For each effective infection rate r, the 
infection process dies out in any finite network after a 
long enough time, and the corresponding steady state 
is the absorbing state: i.e. the overall healthy state. 
However, if the effective infection rate r is larger than 
the epidemic threshold Tc, the epidemic spreads out 
and there is a non-trivial metastable state, where the 
average fraction yao of infected nodes is non-zero and 
stable during a long time [s^. The average fraction 
2/00 of infected nodes indicates the severity of the over¬ 
all infection. 


2.2 Network models 

Amon g v arious network models, Erdos-Renyi (ER) 
model [Ul is one of the most widely-used and well- 
studied models. In an ER random network with N 
nodes, each pair of nodes are connected with probabil¬ 
ity p independent from every other pair, thus the dis¬ 
tribution of the degree of a random node is binomial: 
Pr[D = k] = — p)^~^~^ and the average 

^The infection rate between any pair of nodes equals to the 
mean in the homogeneous SIS model. 
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degree E[D\ = [N — \)p. For a large N and con¬ 
stant average degree, the degree distribution is Poisson: 
Pr[D = k\= e-^P{Npf/k\. 

Besides the ER model, the network model with a 
scale-free degree distribution (SF model) has always 
been used to describe real-world networks such as the 
Internet 0 and World Wide Web[l|. The degree distri¬ 
bution of SF networks is given by Pr[D = k]r^k~^,k G 
[dmin,dmax], where dmin is the smallest degree, dmax is 
the degree cutoff, and A is the exponent characteriz¬ 
ing the broadness of the distribution [^. In real-word 
networks, the exponent A is usually in the range [2,3], 
thus we confine the exponent A = 2.5 in this paper. 
We further employ the smallest degree dmin = 2, the 
natural degree cutoff dmax = [sl , and the 

size N = 10"^. Hence, the average degree is approxim¬ 
ately 4. As the comparison, we consider the ER net¬ 
works with the size N = 10^ and the average degree 
E[D] = 4. 


2.3 Heterogeneous infection rates 

In this subsection, we introduce three distributions 
of the heterogeneous infection rates. We aim to ex¬ 
plore how the heterogeneous infection rates influence 
the spread of SIS epidemics, particularly we study the 
relationship between the varianc^ (and even higher 
moments) of the heterogeneous infection rates and the 
average fraction j/oo of infected nodes. Hence, we would 
like to choose infection-rate distributions systematic¬ 
ally such that they cover a broad range of distributions 
including those observed in real-world and importantly 
their higher order moments, at least the variances are 
tunable when their means are fixed. 

The nth moment of a distribution with the 

probability density function (PDF) /b(/ 3) is m„ = 
+ 00 

/ /3”/B(/3)d/3. Thus, the first moment mi is just the 

— OO 

mean and the relationship between the second moment 
m 2 and variance Var[B] is Var[B\ = m 2 — mf, where 
the random variable B is the infection rate of a link. 
To eliminate the influence of the mean mi, we fur¬ 
ther define the nth normalized moment , then 

^1 = 1 and the normalized variance v = 1^2 — 

We choose two asymmetric distributions: the log¬ 
normal and gamma distribution, of which we can keep 

^The variance of a random variable is the second central mo¬ 
ment. 


the means unchanged and tune the variances in a 
large range. The log-normal distribution B ^ 
Log-J\f{l3; /t, cr), of which the PDF is, for /3 > 0 




1 /(In^^A 

l3aV^ [ (2a2) ) 


and the nth normalized moment is Vn = exp {——), 
has a power-law tail for a large range of /3 provided a 
is sufficiently large. The log-normal distribution has 
as well been widely observed in real-world, where the 
interaction frequency between nodes is usually con¬ 
sidered as the infection rate between those nodes. One 
example is the infection rates of the co-author network, 
as illustrated in Fig. |8(b)[ Section 5. Moreover, Wang 


et al. [35| find that by employing the log-normal dis¬ 
tributed infection rates, their epidemic model can ac¬ 
curately fit the infection data of 2003 SARS. 


The gamma distribution B ~ F(/3; k, 9), of which the 
PDF is, for /3 > 0 

B 

fB{B\k,e) = exp(--)- 


9'9'^r(k) 

00 

(r(k) = f and the nth normalized moment 

0 

n — l 

is n (1 + has a lighter tail than the log-normal 

i=0 

distribution. The Airline network, as demonstrated in 


Fig. 8(a) has an exponentially distributed infection 
rates, which corresponds to the Gamma distribution 
when fc = I. 


In order to take into account symmetrically dis¬ 
tributed infection rates as well, we design a variance- 
tunable and symmetric distribution other than the two 
asymmetric distributions above. We call it the sym¬ 
metric polynomial (SP) distribution B ^ SP{B',a,b), 
whose PDF is 

/B(/3;a,6) = ^h^|/3_ip 

where /3 G [I — I -I- -^j) and, o = 1 and b G [1, -|-oo) 
or 6 = I and a G [l,-|-oo). The mean of the distri¬ 
bution is I, the variance is Compared to the 

commonly-used uniform distribution (also symmetric 
and variance-tunable) with the same mean, the SP dis¬ 
tribution can be tuned in a larger range of the variance. 
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2.4 The simulations 


3.1 The observations 


In order to study the effect of the variance of the hetero¬ 
geneous infection rates on the virus spread, we perform 
simulations to obtain the fraction j/co of infected nodes 
as a function of the normalized variance v of infection 
rates on both ER and SF networks. We find that, for 
commonly used 2-parameter distributions (such as the 
uniform distribution, log-normal distribution, gamma 
distribution, etc.), the scaling on the mean of infec¬ 
tion rates can be eliminated by the same scaling on 
the recovery rate if we keep the normalized variance v 
unchanged. This conclusion is also consistent with the 
fact that only the effective infection rate j matters for 
the epidemic spreading, but not the infection rate /3 in 
the homogeneous SIS model. Hence, without loss of 
generality, we set the mean mi of the infection rates 
to 1, thus all the normalized moments Vn equal to the 
unnormalized ones mn- Instead of performing discrete¬ 
time simulations, we further develop a continuous-time 
simulator, which was firstly proposed by van de Boven- 
kamp and described in detail in 18|. A discrete-time 
simulation could well approximate a continuous process 
if a small time bin to sample the continuous process 
is selected so that within each time bin, no multiple 
events occur. A heterogeneous SIS model allows dif¬ 
ferent as well large infection or recovery rates, which 
requires even smaller time bin size and challenges the 
precision of a discrete-time simulation. Hence, we im¬ 
plement the precise continuous-time simulations. 


3 Small recovery rates 

In this work, the average of the heterogeneous infec¬ 
tion rates and the homogeneous infect rate are the 
same. Since the recovery rate S plays the key role 
in the epidemic spreading, we discuss our results ac¬ 
cording to different ranges of the recovery rates. In 
this section, we introduce our main results about how 
the heterogeneous infection rates influence the conta¬ 
gion processes of epidemic, when the recovery rates are 
small such that the epidemic spreads out in both ho¬ 
mogeneous and heterogeneous cases. In the next sec¬ 
tion, we focus on large recovery rates - the homogen¬ 
eous effective infection rate t is close to the epidemic 
threshold Tc in the classic model, where the epidemic 
with homogeneous infection rates may die out. 


We first show the simulation results when the variance 
of the infection rates is smaller than 1, since the vari¬ 
ance of a non-negative and symmetric distribution can¬ 
not be larger than the square of its mearj^, thus 1 in 
this paper. 



Fig. 1 : The average fraction ?/oo of infected nodes as a 
function of the normalized variance v of infection rates 
for log-normal (o), gamma (□), and SP ((v) infection- 
rate distributions respectively, and the recovery rate 
6 = 2. We consider ER networks with average degree 
E[D] = 4 and network size N = 10"^. The results are 
averaged over 1000 realizations. 

In Fig. HI we find that the average fraction j/oo of 
infected nodes decreases as the variance v of the infec¬ 
tion rates increases, no matter which distribution the 
infection rates follow. Moreover, the comparison of the 
decay of the three curves in Fig. [T] also suggests that, 
the smaller the third momeniQ of the infection rate 
distribution is, the faster j/co decays as the variance 
increases. 

When the variance v is larger than 1, the infection 
rates cannot be symmetrically distributed. We thus 
discuss only the log-normal and gamma distributions 
which are representative among the heavy-tailed dis¬ 
tributions and widely used in the real-world analysis. 

In Fig.HJ we observe the same as in Fig.[TJ Moreover, 
we find that the average fraction yoo of infected nodes 
decays much faster when infection rates follow gamma 

®For any random variable B following a non-negative and 
symmetric distribution /b(/ 3) with mean mi, the smallest and 
largest value that B can reach is 0 and 2mi respectively, so 
the largest variance, which equals to m^, can be reached when 
Pr[B = 0] = Pr[B = 2mi] = 0.5. 

^The third moment of the log-normal, gamma and SP distri¬ 
bution is (u -|- 1)^, {v -\- l)(2u H- 1) and 3u -|- 1 respectively. 
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Fig. 2 : The average fraction j/oo of infected nodes as 
a function of the variance v of infection rates following 
different distributions: log-normal (o) and gamma (□), 
and the recovery rate 5 = 2. The simulations are on 
ER networks with average degree E\D\ = 4 and net¬ 
work size N = 10^. The results are averaged over 1000 
realizations, and the error bars are the standard devi¬ 
ations of the results in different realizations. The inset 
is the same as the main graph, but in a linear-linear 
scale. 

distributions than log-normal distributions. 

Here we only show the simulation results on ER 
random networks with 10“^ nodes and average degree 
E\D\ = 4, because simulation results on SF networks 
lead to the same observations as illustrated in the Ap¬ 
pendix. Moreover, though not shown in this paper, we 
have also done the simulations with various values of 
the recovery rate, such as 6 = 0.1, 0.2, 1, etc., for both 
ER and SF networks and the conclusions are consist¬ 
ent. 

3.2 The influence of the moments of the 
infection rates 

To explain our observations, we consider a susceptible 

node and an infected node interconnected by a link. 

The probability p{T) that the infected node infects 

the susceptible neighbor in an arbitrary period T, is 
+ 00 

P(^) = I fB{P)F{T]P)d^, where /b(/?) is the PDF 
0 

of the infection rate, and F{T ; /?) is the probability that 
infection occurs between the neighboring infected and 
susceptible node pair within the time interval T when 
the infection rate is /3. Since the infection between any 
infected and susceptible node pair is an independent 
Poisson process, the time for an infected node to infect 


a susceptible neighbor is an exponential variable, i.e. 
F{T\I3) = 1 — We consider further the classic 

homogeneous SIS model, whose infection rate is equal 
to the average infection rate E[B] in our heterogeneous 
SIS model. The counterpart of p(T) in the homogen¬ 
eous SIS model is, then, p*{T) = F{T\E[B]). 


Theorem 1 If fsiP) is the probability density fune- 
tion of a non-negative continuous random variable B, 
and F(T] P) is the distribution funetion of an exponen¬ 
tial random variable with the rate parameter (3, then for 
any T > 0, we have 

OO 

J fBmF{T-p) AP<F{T-E[B]) 

0 

Proof. 


fB{P)F{T-p) d/3 


OO 

=1-/ d/3 


-TBi 


=1 - E[e 

Since the exponential function is convex, Jensen’s in¬ 
equality [32| tells us that 

E[e-^^] > 


Hence, 

OO 

J fB{P)F{T- /3) d/3 < 1 - = FiT- E[B]) 

0 


Theorem 1, that proves p{T) < p*{T), tells us that if 
the infection rate in the classic homogeneous SIS model 
and the average infection rate in heterogeneous model 
are the same, then in the same period of time an infec¬ 
tion event is more likely to happen in the classic SIS 
model. 

We define x(r) = p*{T) — p{T) as the difference in 
infection probability within an arbitrary time interval 
T between the SIS model with homogeneous and het- 
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erogeneous infection rates. 


Hence, 


X{T) =E[e 


-TB^ 


— e 


-TE[B] 


oo 


n—0 




=E 


(m„ - TO^) {-TY 


E 

n—O 
oo 

^E- 

n=0 




< 


\{-TrmB-]\ 


n—O 

oo 

E 

n—O 


= E(^2n-l) 


(rTOi)2" 

(2n)! 




2n+l 


^2E 

n=0 




n\ 


n—O 


(2n + l)! 
(3.1) 


=2e 


Tb 


Note that the first step in (13.ip is valid only if the 
sum ^ converges. The general log¬ 

normal distribution over an infinite range does not sat¬ 
isfy this condition. However, the infection rates of real- 
world systems are finite. Theorem 2 states that any 
realistic distribution of the infection rates within a fi¬ 
nite range satisfies this convergence condition. 


Theorem 2 For any non-negative random variable B 
distributed in a finite range [0, b] and any finite T, the 
sum 


g {-TrEjB-] ^ 


n=0 


thus converges. 


Proof. 


E[B^]= [%^fB{Y)dl3 
Jo 


= / 3 ” / fB{Y)df 


fb fh 

/ / fB(P)dfd/3^ 

Jo Jo 


= /3"Fb(/3)|^- / 


Since 


rh 

Eb{/3) = / fBmf 
Jo 


< 1 


we have 


E[B^] < b" 


EBrndf'-^ 


< 26" 


which illustrates the convergence of - 

for any T. 

■ 

Theorem 1 and dSH) explore only on the local ef¬ 
fect: the epidemic spreads on average faster along a 
link in the heterogeneous case than the homogeneous 
case. However, if the infection probabilities of all the 
nodes are similar and the state of the each node (infec¬ 
ted or not) is independent, each connected node pair 
would have a similar fraction of time when one node 
is infected whereas the other is susceptible, i.e. the 
period that allows epidemic to spread. In this case, 
the difference x(T), where 0 < x(T) < 1, in infec¬ 
tion probability along a link within an arbitrary time 
T may indicate the difference in the fraction of infec¬ 
ted nodes between the homogeneous and heterogeneous 
SIS in the metastable state. Both the heterogeneous in¬ 
fection rates and the heterogeneous network topology 
contribute to the heterogeneity in the infection prob¬ 
ability of each node. When the recovery rate is low or 
equivalently the epidemic prevalence is high, however, 
the infection probabilities of the nodes tend to be sim¬ 
ilar. Hence, x(r) could suggest the difference in the 
fraction of infected nodes between the heterogeneous 
and homogeneous cases when the recovery rate is small. 
The larger the difference x(T) is, the smaller the av¬ 
erage fraction j/oo of infected nodes, in the metastable 
states of the heterogeneous SIS is. Equation (I33D, thus 
suggests that, the larger even-order moments of the in¬ 
fection rates lead to a smaller average fraction of infec¬ 
ted nodes yoo, but the odd-order moments contribute 
in the opposite way. These theoretical results help us 
better understand our two observations in Fig. [T] and 
[21 when the recovery rates are small: (a) the average 
fraction yea of infected nodes decreases with the in¬ 
creased variance, and (b) given the same variance, the 
average fraction yao of infected nodes is lower if the 
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third moment of the distribution is smaller. 

3.3 The log-normal distribution vs. the 
gamma distribution 

To explore how fast yoo decays, we perform simula¬ 
tions with different recovery rates 5 and fit the curves 
of Voo vs. the variance v. We find that, as shown 
in Fig. [3J the relationship between the average frac¬ 
tion 2/00 (r) of infected nodes and the variance v can 
be fitted by a double-exponential function yoc.hiv) = 
+ 036“'^“” and a quadratic function 2/00,r(i’) = 
civ'^ — C 2 V + C3, when the infection rates follow log¬ 
normal and gamma distributions respectively. The 
coefficients ci, C2, C3, and C4, shown in Table im also 
suggest that, approximately, 2/00 ,l decreases exponen¬ 
tially with the variance v much slower than the linear 
decrease of 2/00, r when 2/00, r is not close to 0. 
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Fig. 3 : The average fraction 2/00 of infected nodes as 
a function of the variance v of infection rates following 
gamma distributions. The recovery rates 8 are differ¬ 
ent: 1 (o) and 0.5 (□), and the dash lines are fitting 
curves. The simulations are on ER networks with av¬ 
erage degree (fc) = 4 and network size N = 10'’’. The 
results are averaged over 1000 realizations. The inset 
contains the results about log-normal distributions. 

Table 3.1: The coefficients of the fitting func¬ 
tions of 2/00 vs. the variance v for different 
infection-rate distributions under different re¬ 
covery rates. 


Dist. 

5 

Cl 

C2 

C3 

C4 

Log — N 

3 

0.098 

0.045 

0.099 

0.28 

2 

0.20 

O.OII 

0.21 

0.14 

F 

I 

O.OOII 

0.055 

0.67 

N/A 

0.5 

0.00085 

0.053 

0.83 


Besides the theoretical explanation as mentioned be¬ 
fore, we explore further the physical interpretations of 
the difference in the fraction of infected nodes between 


the log-normal and gamma distributed infection rates. 
We define r(/3) as the ratio between the PDF of the log¬ 
normal and gamma distribution, i.e. r{f3) = ■ 

Thus lim r(/l) = 0 and lim r(/3) = 00. This reveals 
/3—>-0 13 —¥00 

that if we set the same mean and variance (large) for 
both distributions, the log-normal distribution tends 
to generate a few extremely large values whereas the 
gamma distribution generates many extremely small 
values to produce the large variance. 


Table 3.2: The percentiles of the log-normal and 
gamma distribution with the mean mi = 1 and 
variance v = IQ 


Percentiles 

Log — Af 

F 


0.00483 

9.44 X 10-32 

2.5*'“ 

0.00895 

2.20 X 10-25 

5th 

0.0152 

1.44 X 10-2° 

lOth 

0.0280 

9.44 X 10-35 

25‘^ 

0.0779 

2.20 X 10-9 

50tft 

0.243 

1.44 X 10-4 


In Table 13.21 we show the percentile^ of the two 
distributions with a large variance z; = 16. In a group 
of random numbers generated by the gamma distribu¬ 
tion, 25% of them are even smaller than 2.2 x 10“®. 
The infection events driven by such small rates can 
hardly happen. However, in the infection rates gener¬ 
ated by the log-normal distribution, even the first 1% 
smallest values are large enough to make possible infec¬ 
tions. Hence, the gamma distribution effectively filters 
the network more than the log-normal distribution, and 
reduce the spread of the epidemic more. This interpret¬ 
ation is also consistent with the theoretical explanation 
of the influence of the third moment of a distribution. 
The same large variance can be introduced by the log¬ 
normal distribution via the possibility of generating a 
large value and by the gamma distribution via the high 
probability of generating extremely small values. How¬ 
ever, the gamma distribution leads to a smaller third 
moments compared to the log-normal distribution and 
the small infection rates it generates effectively filter 
the network, reducing the epidemic spread. 

percentile is a measure to indicate the value below which a 
given percentage of observations in a group of observations fall. 
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4 Large recovery rates 


We have shown that when the recovery rates are small, 
the i.i.d. heterogeneous infection rates retards the epi¬ 
demic spreading and the larger variance of infection 
rates leads to a smaller average fraction of infected 
nodes. Moreover, we further explained the influence of 
the higher moments of the infection rate on epidemic 
spreading. In this section, we discuss how the hetero¬ 
geneous infection rates influence the epidemic spread¬ 
ing when the recovery rate is large, thus, the epidemic 
is close to die out. As an example, we show the simu¬ 
lation results of the SF networks with the log-normal 
distributed infection rates. We find that, the hetero¬ 
geneous infection rates may increase the probability 
that the epidemic spreads out when the recovery rate 
is large, though if the epidemic can spread out, the 
larger variance of infection rates still leads to a smal¬ 
ler average fraction of infected nodes in the metastable 
state. 

We first employ the log-normal distribution for the 
heterogeneous infection rates and set the recovery rate 
S = 20. As shown in Fig. |4l though the average frac¬ 
tion j/oo of infected nodes is close to 0 (due to the large 
recovery rate), we can observe that the larger variance 
may lead to a slightly larger average fraction y^o of in¬ 
fected nodes. However, the error bars (the standard 
deviation of the simulation results from different real¬ 
izations) are large as compared to the average fraction 
of infected nodes. This is due to the fact that when 
the epidemic is close to die out on average, i.e. when 
(5 = 20, the epidemic dies out in some iterations of the 
simulations but spreads out with a nonzero fraction of 
infected nodes in the metastable state in the others. 

Fig. [5] shows the percentage p* (g [0,1]) of the 
spread-out realizations in all realizations and the av¬ 
erage fraction y’^ of infected nodes in these nonzero- 
infection realizations as a function of the variance of the 
infection rates. Here the the simulations are on SF net¬ 
works with the size V = 10^ and the exponent A = 2.5. 
Clearly, the average fraction of infected nodes obtained 
by averaging that in all realizations is j/oo = P*yto- We 
find that, in all nonzero-infection realizations, the aver¬ 
age fraction of infected nodes still decreases as the 
variance of the infection rates increases. The average 
fraction y^a of infected nodes obtained from all real¬ 
izations may increase as the variance of the infection 
rates increases, because the percentage p* of nonzero- 
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Fig. 4 : The average fraction yoo of infected nodes as 
a function of the variance v of infection rates follow¬ 
ing the log-normal distribution, and the recovery rate 
d = 20. The simulations are on SF networks with the 
exponent A = 2.5 and the network size N = 10^. The 
results are averaged over 1000 realizations, and the er¬ 
ror bars are the standard deviations of the results in 
different realizations. 

infection realizations increases when the variance of the 
infection rates is small and increases. Hence, the het¬ 
erogeneous infection rates may enhance the probability 
that the epidemic spreads out. This can be explained 
as follows: the heterogeneous infection rates and the 
hubs in scale-free networks enable those links with a 
large infection rate to form a connected subgraph, al¬ 
lowing the epidemic to spread out. However, when the 
variance v is large and further increases, as shown in 
Fig. [5l the fraction of non-zero infection realizations 
decreases. This is because, a large variance v of the in¬ 
fection rates produces fewer large infection rates, pro¬ 
hibiting the formation of a connected subgraph with 
high infection rates that allows the epidemic to spread. 
However, the average fraction of infected nodes of the 
nonzero-infection realizations tend to decrease with the 
variance or heterogeneity of the infection rates. 

If we increase the recovery rate to ensure that the 
epidemic dies out in the homogeneous case, i.e. the ef¬ 
fective infection rate is below the epidemic threshold 
Tc in the classic SIS model, we obtain the same con¬ 
clusions: the average fraction of infected nodes 
in nonzero-infection realizations (if exist) always de¬ 
creases as the variance of the infection rates increases, 
and the heterogeneous infection rates may increase the 
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Fig. 5 : The average fraction (o) of infected nodes 
in the nonzero-infection realizations and the percent¬ 
age p* (□) of the nonzero-infection realizations as a 
function of the variance v of infection rates. The in¬ 
fection rates follow the log-normal distribution and the 
recovery rate S = 20. The simulations are on SF net¬ 
works with the exponent A = 2.5 and the network size 
N = 10'^. The results are averaged over 1000 realiza¬ 
tions, and the error bars are the standard deviations of 
the results in different realizations. 

probability that the epidemic spreads out. 

We further compare the simulation results between 
the log-normal and gamma distributions. As shown 
in Fig. |6(a)[ the average fraction of infected nodes 
in nonzero-infection realizations is larger when the in¬ 
fection rates follow the log-normal distribution than 
the gamma distribution. This observation is consist¬ 
ent with our previous observations and conclusions as 
illustrated in Section [H when the variances of the in¬ 
fection rates are the same, the larger third moments 
of the infection rates lead to the more severe infection. 
However, as shown in Fig. |6(b)[ when the variance of 
the infection rates is small, the percentage pp of the 
nonzero-infection realizations is larger in the case of the 
gamma distributed infection rates than the percentage 
Pp of the nonzero-infection realizations in the case of 
the log-normal distributed infection rates. Moreover, 
as the variance of the infection rates is relatively large 
(for example, around 30 in Fig. |6(b)[ ) and increases, 
Pp decreases faster than p^, and Pp could be smaller 
than Pp if the variance is large enough. Given a net¬ 
work and a large recovery rate, more large infection 
rates lead to a higher probability that the epidemic 
can spread out. As in Section 13.31 we can explain the 


observations in Fig. |6(b)] by exploring the percentiles of 
the log-normal and gamma distributions with the mean 
0.5 1 in Table l4Tl where two values (16 and 128) of the 
Q ^ variance are employed as examples. When the variance 
is 16, there are more large values in a group of random 
0 .3"^^umbers generated by the gamma distribution than the 
log-normal distribution; however, when the variance 

0 2 

increases to 128, though the first 1% largest values of 
Q ■\ the gamma distribution are still larger than those of 
the log-normal distribution, there are more large val¬ 
ues in the group of the log-normal random numbers. 
Hence, with the same small variance, the gamma dis¬ 
tributed infection rates contribute more to the survival 
of the epidemic than the log-normal distributed infec¬ 
tion rates, whereas with the same large variance, the 
log-normal distributed infection rates may lead to a 
higher probability that the epidemic spreads out. 

Table 4.1: The percentiles of the log-normal and 
gamma distributions with the mean mi = 1 and 
variance z; = 16 and 128 



Log—M 

u = 16 

r 

n = 16 

Log—Af 

V = 128 

r 

V = 128 

ggth 

12.1936 

19.9409 

15.1178 

24.2306 

ggth 

7.7225 

12.9981 

8.2133 

5.7424 

gjth 

5.7697 

9.3923 

5.6176 

1.5509 

gQth 

4.6209 

7.1783 

4.2063 

0.4140 

95*^* 

3.8751 

5.6325 

3.3260 

0.1507 


We observe the same in ER networks as shown in 
the Appendix. Moreover, the links with i.i.d. large in¬ 
fection rates are more likely to form a subgraph in SF 
networks than in ER networks, because of the exist¬ 
ence of the nodes with large degrees in SF networks. 
Hence, with the similar value of the average fraction y^ 
of infected nodes in the nonzero-infection realizations, 
we find that the percentage p* of the nonzero-infection 
realizations is much smaller in ER networks than SF 
networks. 

We further consider an extreme case of SF net¬ 
works - the star network: one central node no con¬ 
nects with all the other m (m ^ 1) side nodes rii 
(i = 1 , 2 ,..., m), and there is no link between any pair 
of the side nodes. By designing a specific distribu¬ 
tion of the heterogeneous infection rates, we can al¬ 
ways give a value of the recovery rate 6 so that the 
epidemic spreads out with the heterogeneous infection 


9 
































(a) 


rates but dies out with the corresponding homogeneous 
infection rates in a finite-size star network. In the clas¬ 
sic model, the epidemic threshold of a star network is 
Tc = j = [Slj . If we set the homogeneous infection 
rate /3 = 1 and the recovery rate S = -I- e, where e 

is a positive but small constant number, then the epi¬ 
demic dies out. With the same recovery rate, we set 
the heterogeneous infection rate with the distribution 
Pr[B = 2 —ei] = Pr[B = ei] = 0.5, where ei is again a 
small and positive constant number, thus the average 
infection rate E[B] = I. We now look at the subgraph 
which is composed of the central node and approxim¬ 
ately Y side nodes connected to the central node with 
infection rate j3sub = 2 — ei. The effective infection 


rate is Tsub = 


^ 2-ei 

5 


> 


'^CjSUb 5 


V™ ' ^ym/2 

where Tc,sub is the epidemic threshold of the subgraph. 
Hence, with the same recovery rate and the same av¬ 
erage infection rates, the epidemic dies out in the ho¬ 
mogeneous case but spreads out in the aforementioned 
heterogeneous case. 



(b) 

Fig. 6: (a) The average fraction of infected nodes 
in the nonzero-infection realizations and (b) The per¬ 
centage p* of the nonzero-infection realizations in all 
realizations as a function of the variance v of the infec¬ 
tion rates which follow the gamma (o) and log-normal 
(□) distribution. 


5 Real-world networks 

As mentioned in Section [2.31 the interaction frequency 
between two nodes in a real-world network has been 
considered as the infection rate between the pair of 
nodes. In this section, we choose two real-world net¬ 
works as examples to illustrate how their heterogen¬ 
eous infection rates affect the spread of SIS epidemics 
on these networks. The heterogeneous infection rates 
from the datasets are normalized by the average so 
that the average is I. We compare the average frac¬ 
tion of infected nodes in the metastable state of the 
two networks in the 3 scenarios: I) each network is 
equipped with its normalized original heterogeneous 
infection rates (hetero-/3) as given in the dataset; 2) 
each network is equipped with the infection rates in 
the normalized original dataset but randomly shuffled 
(shuffled-/?); 3) each network is equipped with a con¬ 
stant infection rate (homo-/?) which equals to the av¬ 
erage infection rate of the normalized original infec¬ 
tion rates as given in the datasets. The heterogeneous 
infection rates in each network described in Scenario 
1 are possibly correlated. For example, the infection 
rate of a link may depend on the degrees of the two 
ending nodes of this link. The shuffling in Scenario 2 
effectively removes the correlation if it exists, and the 
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infection rates in Scenario 3 are homogeneous as in the 
classic SIS model. Our objective is to explore the re¬ 
lation between the infection rates and average fraction 
of infection in these 3 scenarios for both networks to 
verify our previous findings. 

The first network is the airline network where the 
nodes are the airports, the link between two nodes in¬ 
dicates that there’s at least one flight between these 
two airports, and the infection rate along a link is 
the number of flights between the two airports. We 
construct this network and its infection rates from the 
dataset of openFlightfl The other one is the co-author 
network, where the nodes are the authors of papers, 
the link represents that the two corresponding authors 
have at least one collaborated paper, and the infection 
rate is the collaboration frequency 2l| . 

Besides the infection rates, the network topology 
may as well influence the spread of SIS epidemics. We 
explore the most fundamental network feature of the 
two networks: the degree distributions which are shown 
in Fig. [7l We can see that the degree distributions of 
the airline network and co-author network approxim¬ 
ately follow a power law with the slope A = 1.5 and 2.5 
respectively. Hence, the degree distributions of the two 
networks influence the spread of epidemics in a similar 
way. More details of the two networks are listed in 
Table Em Note that we normalized the infection rates 
of each network by its mean so that the average rate is 
1 . 


Table 5.1: The number of nodes, number of 
links, variance of infection rates and range of 
infection rates in the two networks. 


Name 

Nodes 

Links 

Variance 

Range 

Airline 

3071 

15358 

0.5560 

[0.2383,11.0626] 

Co-author 

39577 

175692 

3.0566 

[0.0678,90.4625] 


The distributions of the infection rates from the two 
networks are shown in Fig. |8(a )]and |8(bj] We find that, 
approximately the infection rates of the airline network 
are exponentially distributed, whereas those of the co¬ 
author network follow a log-normal distribution. Both 
of the two datasets support our previous choices of the 
infection-rate distribution. 
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(a) 



Fig. 7: The degree distributions of the airline (■) and 
co-author network(*) can be approximately fitted by 
the power law distribution with a slope A = 1.5 and 
2.5 respectively. 


®http: //openfiights.org/data.html 


(b) 

Fig. 8: The distribution of the infection rates from 
real-world networks: (a) airline network and (b) co¬ 
author network. In each figure, the distribution (o) 
and fitting curve (dash line) are shown. The fitting 
curves are exponential and log-normal distributions in 
(a) and (b) respectively. 
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5.1 Small recovery rates 


We first consider the small recovery rates, with which 
the epidemic does not die out in any realizations. In 
this paper, we assume that the infection rates are 
i.i.d. which corresponds to Scenario 2. As shown in 
Fig. |9(a)| and |9(b)] the average fraction ?/oo of infected 
nodes in Scenario homo-/3 is always larger than that in 
Scenario shuffled-/?, which confirms our conclusion that 
the heterogeneity of infection rates on average retards 
the contagion processes of epidemics, when recovery 
rates are not very large. Moreover, we find that the 
reduction ?/cx).homo-/3 - 2/cx).shuffled-/3 is larger in the co¬ 
author network, which has a larger variance of infection 
rates, than that in the airline networlj^. This observa¬ 
tion verifies our conclusion that, the larger the variance 
of the infection rates is, the smaller yoo is. Compared 
to the independent infection rates in the case shuffled- 
P, the possibly correlated infection rates in the case 
hetero-,5 can further decrease (in e.g. the airline net¬ 
work) or increase (in e.g. the co-author network) the 
average fraction of infected nodes. This observation 
points out a new challenging question: what is the in¬ 
fluence of such correlated heterogeneous infection rates 
on the SIS epidemics. 

5.2 Large recovery rates 

As shown in Fig. 1101 when the recovery rate increases 
and the effective infection rate is close to the epidemic 
threshold, the average fraction y^o of infected nodes in 
the Scenario hetero-,5 becomes mostly larger than that 
in the other two scenarios. Besides that, it is still con¬ 
sistent with our previous conclusion that if j/oo,homo-,a ^ 
0, then j/oo,homo-/3 > 2/oo,shuffled-/? • Moreover, in the 
co-author network, we observe that when the recov¬ 
ery rate 5 — 40, //oo,shuffled-/? ^ yoo,homo-/? — 0- How¬ 
ever, in the airline network, we cannot observe that 
2/oo.shuffled-/? > 2 / 00 ,homo-/? with any selected recovery 
rate, and this may be because of the small variance of 
the infection rates. Hence, the observations verify our 
conclusions that if the epidemic spreads out with the 
homogeneous infection rates, then the overall infection 
is always more severe than that with the heterogeneous 
infection rates (i.i.d. and with the same mean as the 
homogeneous infection rate); however, the heterogen- 

’'We assume that the two networks have a similar topology, 
since they have a similar degree distribution as shown in Fig. [7| 



(a) 



(b) 

Fig. 9 : The average fraction y^^ of infected nodes as a 
function of the recovery rate 5. The networks are from 
real-world: (a) airline network and (b) co-author net¬ 
work. In each figure, the SIS model with homogeneous 
(o), original heterogeneous (□) and shuffled heterogen¬ 
eous (v) infection rates are compared. 
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eous infection rate may contribute to the survival of 
the epidemic. 



(a) 



(b) 

Fig. 10: The average fraction of infected nodes as 
a function of the recovery rate <5. The recovery rate is 
very large so that the effective infection rate is close to 
the epidemic threshold. The networks are from real- 
world: (a) airline network and (b) co-author network. 
In each figure, the SIS model with homogeneous (o), 
original heterogeneous (□) and shuffled heterogeneous 
(v) infection rates are compared. 

6 Discussions 

In summary, we illustrate with simulations, theoretical 
analysis and physical interpretations that, when the 
recovery rate is small, the heterogeneity of infection 
rates on average retards the virus spread and whereas 
the larger even-order moments of the infection rates 
tend to lead to a smaller y^o, the odd-order moments 
contribute in the other way around; when the recov¬ 


ery rate is large so that the epidemic may die out, the 
heterogeneous infection rates may enhance the prob¬ 
ability that the epidemic spread out. We also verify 
the influence of the heterogeneity of infection rates on 
virus spread in real-world networks. Our work reveals 
that the higher moments, especially the variance, of 
the infection rates may evidently affect the epidemic 
spread, even far more seriously than intuitively expec¬ 
ted. Our finding implies that real-world heterogeneous 
epidemic spread may not be as severe as the classic ho¬ 
mogeneous SIS model predicts, but the heterogeneous 
epidemic may not be as easy as the homogeneous SIS 
model indicates to die out. 

In this work, we have focused on the Markovian SIS 
where the time for an infected node i to infect a sus¬ 
ceptible neighbor j is an exponential random variable 
with rate Pij. Theorem 1 can be extended to Non- 
Markovian SIS models with heterogeneous infection 
rates where the infection time between a neighboring 
infected susceptible node pair (i, j) with average l/Pij 
follows a distribution other than the exponential distri¬ 
bution. Such extension to Non-Markovian SIS models 
is possible if 1 — F{t; jS) the probability that the infec¬ 
tion time is larger than r when the average infection 
time is 1//3 is a convex function of jS. 

The time for an infected node to infect a susceptible 
neighbor is more in depth and detailed information. 
Infection time measurement becomes possible though 
in general is still challenging. For example, in the ex¬ 
periments of the epidemic in the plant population, the 
infection time can be measured. As more such data¬ 
sets become available, it would be interesting to tackle 
a new direction: what is the influence of the hetero¬ 
geneous infection time on viral spreading? 
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(b) 

Fig. 11: The average fraction ?/oo of infected nodes as a 
function of the normalized variance f ((a) z; < 1 and (b) 
z) > 1) of infection rates for log-normal (o), gamma (□), 
and SP ((v) infection rates distribution respectively, 
and the recovery rate 5 = 2. We consider SF networks 
with the exponent A = 2.5 and network size N = 10'^. 
The results are averaged over 1000 realizations. 
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Fig. 12: (a) The average fraction of infected nodes 
in the nonzero-infection realizations and (b) The per¬ 
centage p* of the nonzero-infection realizations in all 
realizations as a function of the variance v of the infec¬ 
tion rates which follow the gamma (o) and log-normal 
(□) distribution. We consider ER networks with the 
average degree E[D] = 4 and network size N = 10"'’. 
The recovery rate is (5 = 3.95. The results are averaged 
over 1000 realizations. 
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